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SEQUENCE LISTING 
( 1 ) GENERAL\ INFORMATION : 

(i) APPLICANT: SHIONOGI SEIYAKU KABUSHIKI KAISHA 

(ii) TITLE\OF INVENTION: A human endothelin recep- 
tor 

(iii) NUMBER &F SEQUENCES: 2 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSER: SHIONOGI SEIYAKU KABUSHIKI 

KAISHA 

(B) STREET: 3-Y^8, Dosho-machi 

(C) CITY: Chuo-ku 

( D ) STATE : OsakaN 
( E ) COUNTRY : Japa^ 
(F) ZIP: 541 



(vii) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette. 3.50 inch, 

1024 \jcb storage 

(B) COMPUTER: EPSON 

(C) OPERATING SYSTEM: M&-DOS ver. 2.11 

( D ) SOFTWARE : Wordstar vfer . 5.0 



(viii) PRIOR APPLICATION DATA: \ 

(A) APPLICATION NUMBER: JP\3-172828> 

(B) FILING DATE: 12- July- 199tl 
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P 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) \LENGTH: 4105 base pairs 

(B) T*YPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE WpE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM \ human 

(vii) DIRECT ORIGINAL SOURCE: 

(B) CLONE: phET: 

(ix) FEATURE: 

(A) NAME/KEY: 

(B) LOCATION: 485 1768, 

(C) IDENTIFICATION METHOD: by experiment 

(A) NAME/KEY: signal ^peptide 

(B) LOCATION: 485 5^4 

(C) IDENTIFICATION METHOD: 



'similarity /to 



other signal sequences 

(A) NAME/KEY: mat peptid«=t 

(B) LOCATION: 545 

( C ) IDENTIFICATION 
other signal sequences 



1768\^ 
METHOD : 



similarity to 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 1: 



GAATTCGCGG CCGCCTCTTG CGGTCCCAGA GTGGAGTGGA AGGTCTGGAG CTTTGGGAGG 



AGACGGGGAGv GACAGACTGG KGGCGTGTTC CTCCGGAGTT TTCTTTTTCG TGCGAGCCCT 



\ 
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\CGCGCGCGCG TACAGTCATC CCGCTGGTCT GACGATTGTG GAGAGGCGGT GGAGAGGCTT 180 



Aatccatccc ACCCGGTCGT CGCCGGGGAT TGGGGTCCCA GCGACACCTC CCCGGGAGAA 240 

CCAGTGCCCA GGAAGTTTTC TGAAGtCGGG GAAGCTGTGC AGCCGAAGCC GCCGCCGCGC 300 

(j :ggagcccgg GACACCGGCC ACCCTCCGCG ccacccaccc TCGCTTTCTC CGGCTTCCTC 360 

DGGCCCAGGC GCCGCGCGGA CCCGGCAG&T GTCTGCGCAC GCCGAGCTCC ACGGTGAAAA 420 



(Iaaaagtgaa ggtgtaaaag cagcacaagtygcaataagag atatttcctc AAATTTGCCT 
[Jaag atg gaa acc ctt tgc ctc agg gVa tcc ttt tgg ctg gca ctg gtt 



TCA ATG CAC AAC TAT TGC CCA CAG CAG ACT AAA ATT ACT TCA GCT TTC 



480 



529 



ik&Z Glu Thr Leu Cys Leu Arg Ala Ser\Phe Trp Leu Ala Leu Val 
£3 -20 -15 \ -10 

JjpGA TGT GTA ATC AGT GAT AAT CCT GAG AGA TAC AGC ACA AAT CTA AGC 577 
=? . \ v 

"Gly Cys Val lie Ser Asp Asn Pro Glu Arg\Tyr Ser Thr Asn Leu Ser 
-5 -1 1 5\ 10 

AAT CAT GTG GAT GAT TTC ACC ACT TT.T CGT G^C ACA GAG CTC AGC TTC 625 

Asn His Val Asp Asp Phe Thr Thr Phe Arg GlA Thr Glu Leu Ser Phe 
15 20. \ 25 

CTG GTT ACC ACT CAT CAA CCC ACT AAT TTG GTC (3|TA CCC AGC AAT GGC 673 

\ 

Leu Val Thr Thr His Gin Pro Thr Asn Leu Val Leu Pro Ser Asn Gly 
30 35 \ 40 



721. 




Ser 



AAA 



Viet His Asn Tyr Cys Pj 
45 
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Gin Gin Thr Lys He Thr Ser Ala Phe 

55 



TAC ATT AAC ACT GTG ATA VcT TGT ACT ATT TTC ATC GTG GGA ATG 



Lys/ Tyr He Asn Thr Val He Sex Cys Thr lie Phe He Val Gly Met 

65 \ 70 75 

GGG AAT GCA ACT CTG CTC AGG \aTC ATT TAC CAG AAC AAA TGT ATG 

VAX Gly Asn Ala Thr Leu Leu Arg Ae He Tyr Gin Asn Lys Cys Met 



«5' 



80 \ 85 90 

iGG AAT GGC CCC AAC GCG CTG ATA GCC\AGT CTT GCC CTT GGA GAC CTT 



i-rg Asn Gly Pro Asn Ala Leu He Ala 
95 100 



ier Leu Ala Leu Gly Asp Leu 

105 



769 



817 



865 



/&TC TAT GTG GTC ATT GAT CTC CCT ATC AAT GTA TTT AAG CTG CTG GCT 913 

He Tyr Val Val He Asp Leu Pro He AsnWal Phe Lys Leu Leu Ala 
110 115 \ 120 

GGG CGC TGG CCT TTT GAT CAC AAT GAC TTT G^C GTA TTT CTT TGC AAG 961 

Gly Arg Trp Pro Phe Asp His Asn Asp Phe Gl± Val Phe Leu Cys Lys 
125 130 \ 135 

CTG TTC CCC TTT TTG CAG AAG TCC TCG GTG GGG VtC ACC GTC GTC AAC 1009 

Leu Phe Pro Phe Leu Gin Lys Ser Ser Val Gly lie Thr Val Leu Asn 
140 145 150 \ 155 

CTC TGC GCT CTT AGT GTT GAC AGG TAC AGA GCA GTT\ GCC TCC TGG AGT 1057 
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LeuJCys Ala Leu SeV Val Asp, Arg Tyr Arg Ala Val Ala Ser Trp Ser 

160\ 165 170 



CGT GTT CAG GGA ATT GGG ATT CCT TTG GTA ACT GCC ATT GAA ATT GTC 1105 

Arg val Gin Gly lie Gly Tie Pro Leu Val Thr Ala lie Glu lie Val 
175 \ 180 185 



TCC HTC TGG ATC CTG TCC TTT\ ATC CTG GCC ATT CCT GAA GCG ATT GGC 

□ 



1153 



$er /lie Trp lie Leu Ser Phe lie Leu Ala lie Pro Glu Ala lie Gly 
190 19$ 200 



■?TG/ GTC ATG GTA CCC TTT GAA TATXAGG GGT GAA CAG CAT AAA ACC TGT 1201 

~. / ' \ 

fisBd Val Met Val Pro Phe Glu Tyr Arg Gly Glu Gin His Lys Thr Cys 



mi 

JSS5S. f 

s 5/ 



205 



210 



215 



Mg CTC AAT GCC ACA TCA AAA TTC ATG GAG TTC TAC CAA GAT GTA AAG 1249 

Mit Leu Asn Ala Thr Ser Lys Phe Met Glu Phe Tyr Gin Asp Val Lys 
2fc0 225 \ 230 • 235 



GAC TGG TGG CTC TTC GGG TTC TAT TTC TGT . ATG CCC TTG GTG TGC ACT 



1297 



A$p Trp Trp Leu Phe Gly Phe Tyr Phe Cys Met Pro Leu Val Cys Thr 

240 245 \ 250 



GCffi ATC TTC TAC ACC CTC ATG ACT TGT GAG ATG TTG AAC AGA AGG AAT 



1345 



Ala\lle Phe Tyr Thr Leu Met Thr Cys Glu Met ^Leu Asn Arg Arg Asn 
255 260 \ 265 
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GGG 



AGC TTG AGA ATT 1GCC CTC AGT GAA CAT CTT AAG CAG CGT CGA GAA 1393 



Gly Ser Leu Arg lie Alk Leu Ser Glu His Leu Lys Gin Arg Arg Glu 
270 \ 275 280 



GTG GCA AAA ACA GTT TTC TGC TTG GTT GTA ATT TTT GCT CTT TGC TGG 



1441 



Val Ala Lys Thr Val Phe Cys\ Leu Val Val lie Phe Ala Leu Cys Trp 
285 290 \ 295 

TTC/CCT CTT CAC TTA AGC CGT A^A TTG AAG AAA ACT GTG TAT AAC GAA 

Pro Leu His Leu Ser Arg Ile\ Leu Lys Lys Thr Val Tyr Asn Glu 

305 \ 310 315 



GAC AAG AAC CGA TGT GAA TTA OCTT AGT TTC TTA CTG CTC ATG GAT 



ft Asp Lys Asn Arg Cys Glu Leu Lefc Ser Phe Leu Leu Leu Met Asp 
320 \ 325 330 



1489 



1537 



TAC ATC GGT ATT AAC TTG GCA ACC ATG AAT TCA TGT ATA AAC CCC ATA 1585 

Tpr lie Gly lie Asn Leu Ala Thr Met ASn Ser Cys lie Asn Pro lie 
335 340 \ 345 

CT CTG TAT TTT GTG AGC AAG AAA TTT AAA \AAT TGT TTC CAG TCA TGC 1633 

4la Leu Tyr Phe Val Ser Lys Lys Phe Lys Aten Cys Phe Glh Ser Cys 
350 355 \ 360 

(hTC TGC TGC TGC TGT TAC CAG TCC AAA AGT CTd ATG ACC TCG GTC CCC 1681 

4eu Cys Cys Cys Cys Tyr Gin Ser Lys Ser Leu Met Thr Ser Val Pro 
365 370 375 
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ATG aW GGA ACA AGC ATC CKG TGG AAG AAC CAC GAT CAA AAC AAC CAC 1729 

Met A^n Gly Thr Ser lie GlnVTrp Lys Asn His Asp Gin Asn Asn His 

385 \ 390 395 



1775 



AAC AOA GAC CGG AGC AGC CAT AAG GAC AGC ATG AAC TGACCACCCT 

Asn Thjr Asp Arg Ser Ser His Lys\ Asp Ser Met Asn 

400 \ 405 



f^GAACCACT CCTCGGTACT CCCATAATCC VcTCGGAGAA AAAAATGACA AGGCAACTGT 1835 

dRcTCCGGGA ATCTCTTCTC TGATCCTTCT TGCTTAATTC ACTCCCACAC CCAAGAAGAA 1895 

^tga \ ■. . 

k&' \ ' 

AtGCTTTCCA AAACCGCAAG GTAGACTGGT TTAVrCCACCC ACAACATCTA CGAATCGTAC 1955 

TTCTTTAATT GATCTAATTT ACATATTCTG CGTGlfTGTAT TCAGCACTAA AAAATGGTGG 2015 

rff 

(ftcClGGGGG AGAATGAAGA CTGTTAAATG AAACCKGAAG GATATTTACT ACTTTTGCAT 2075 

GAAA/TAGAG CTTTCAAGTA CATGGCTAGC TTTTATbGCA GTTCTGGTGA ATGTTCAATG 2135 

GGAAC TGGTC ACCATGAAAC TTTAGAGATT AACGACAAGA TTTTCTACTT TTTTTAAGTG 2195 

ATTTT TTGTC CTTCAGCCAA ACACAATATG GGCTCAGGTC ACTTTTATTT GAAATGTCAT 2255 

TTGGTGCCAG TATTTTTTAA CTGCATAATA GCCTAACATG ATT AT T TG AA CTTATTTACA 2315 

catag4ttga AAAAAAAAAG ACAAAAATAG TATTCAGGTG VAGGAATTAGA TT^AGTATTTT 2375 



CCACGTCACT ATTTATTTTT T T AAA AC AC A A A T T CT A A AG OTACAACAAA TACTACAGGC 2435 
CCTTAAAGCA CAGTCTGATG ACACATTTGG CAGTTTAATA GaVgtTACTC AAAGAATTTT 2495 



TTAAGAAQTG. TATTTTATTT TTTAAATGGT GTTTTATTAC AAGGGACCTT GAACATGTTT 2555 
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TGTATGTTAA ATTCAAAAGT AATfeCTTCAA TCAGATAGTT CTTTTTCACA AGTTCAATAC 2615 

TGT' 'TTTCAT GTAAATTTTG TATGaXaAAT CAATGTCAAG TACCAAAATG TTAATGTATG 2675 

TGTGATTTAA CTCTGCCTGA GACTTTC«GT GCACTGTATA TAGAAGTCTA AAACACACCT 2735 

AAGAGAAAAA GATCGAATTT TTCAGATGATJ TCGGAAATTT TCATTCAGGT ATTTGTAATA 2795 

GTGACAT ATA TATGTATATA CATATCACCT VcTATTCTCT TAATTTTTGT TAAAATGTTA 2855 

JSbTGGCAGTA AGTCTTTTTT GATCATTCCC TTTTCCATAT AGGAAAGATA ATTTTGAAGT 2915 



^SCCAGATGA GTTTATCATG TC AG TG AAA A ATaATTACCC ACAAATGCCA CCAGTAACTT 

Kacgattctt CACTTCTTGG GGTTTTCAGT atgaacctaa ctccccaccc caacatctcc 

■j. it \ 

CS'CCCACATT GTCACCATTT CAAAGGGCCC ACAGTGACTT TTGCTGGGCA TTTTCCCAGA 



TGT 



2975 
3035 
3095 



IJIsT TTACAGA CTGTGAGTAC AGCAGAAAAT CTTTTACTAG TGTGTGTGTG TAT AT AT AT A 3155 



AACAATTGTA AATTTCTTTT AGCCCATTTT TCTAGAffiTGT CTCTGTGGAA TATATTTGTG 3215 



GTGATAT ATGCATGTGT GTGATGGTAT GTATGGATTT AATCTAATQT AATAATTGTG 3275 



CCCCGCAGTT GTGCCAAAGT GCATAGTCTG AGCTAAAATC TAGGTGATTG TTCATCATGA 3335 
CAACCTGCCT CAGTCCATTT TAACCTGTAG CAACCTTCTffi CATTCATAAA TCTTGTAATC ^ 3395 
ATGTTACCAT TACAAATGGG ATATAAGAGG CAGCGTGAAa\ GCAGATGAGC TQTGGACTAG 3455 



CAATATAGGG TTTTGTTTGG TTGGTTGGTT TGATAAAGCA fflTATTTGGGG TCATATTGTT 3515 



TCGTGTGCTG GAGCAAAAGT CATTACACTT TGAAGTATTA TATTGTTCTT ATCCTCAATT 357 5 



CAATGTGGTG ATGAAATTGC GAGGTTGTCT GATATTTCTT TcAgaCTTCG CCAGACAGAT 3635 
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TGCTGATAAT AAATTAG.GTA AGATAATTTG TTGGGCCATA TTTTAGGACA GGTAAAATAA 3695 



CATCAGGTTC CAGTTGCTTG AATTGCAAGG CTAAGAAGTA CTGCCCTTTT GTGTGTTAGC 3755 



AGTCAAATCT ATTATTCCAC \?GGCGCATCA TATGCAGTGA TATATGCCTA TAATATAAGC 3815 
CATAGGTTCA CACCATTTTG TTTAGACAAT TGTCTTTTTT TCAAGATGCT TTGTTTCTTT 3875 



CATAT 



£GCAT|rTTAT TTATGGACTG GTAAGT^ACT GTGGTTTACT AGCAGGAATA TTTCCAATTT 3995 
■CTACCTTTAC TACATCTTTT CAACAAGT&A CTTTGTAGAA ATGAGCCAGA AGCCAAGGCC 4055 



^TGAGTTGGC AGTGGCCCAT AAGTGTAAaA TAAAAGTTTA CAGAAACCTT 



£3 



m. 



AAAA AAATGCATTT TATAAATTCA GAAAGTCATA GATTTCTGAA GGCGTCAACG 3935 



(2) INFORMATION FOR SEC ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE9S : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CD^JA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: humai 
(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 238 . . \l566 

(C) IDENTIFICATION METHOD: by experiment 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



4105 
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GAGACATTCC GGTGGGGGAC TlCTGGCCAGC CCGAGCAACG TGGATCCTGA GAGCACTCCC 
AGGTAGGCAT TTGCCCCGGT GGgVcGCCTT GCCAGAGCAG TGTGTGGCAG GCCCCCGTGG 
AttGATCAACA CAGTGGCTGA ACACTGGGAA GGAACTGGTA CTTGGAGTCT GGACATCTGA 
i/acttggctc TGAAACTGCG GAGCGGCCAC CGGACGCCTT CTGGAGCAGG TAGCAGC 



60 
120 
180 
237 



^TG CAG CCG CCT CCA AGT CTG TGC GGA CGC GCC CTG GTT GCG CTG GTT 



285 



.Met Gin Pro Pro Pro Ser Leu Cyte Gly Arg Ala Leu Val Ala Leu Val 
. 5 \ 10 15 



•assy 3_ 



MSTT GCC TGC GGC CTG TCG CGG ATC\TGG GGA GAG GAG AGA G°C TTC CCG 333 

m ■ \ 

SJ'- \ 

= Leu Ala Cys Gly Leu Ser Arg lie irp Gly Glu Glu Arg Gly Phe Pro 

Q 20 25 30 

CI;CT GAC AGG GCC ACT CCG CTT TTG cA ACC GCA GAG ATA ATG ACG CCA 381 

- Pro Asp Arg Ala Thr Pro Leu Leu Gln\Thr Ala Glu lie Met Thr Pro 

35 40 \ 45 



CCC ACT AAG ACC TTA TGG CCC AAG GGT TCC AAC GCC AGT CTG GCG CGG 429 



Pro Thr Lys Thr Leu Trp Pro Lys Gly Ser Asn Ala Ser Leu Ala Arg 
50 55 \ 60 



TCG TTG GCA CCT GCG GAG GTG CCT AAA GGA. GAC AGG ACG GCA GGA TCT 



Ser Leu Ala Pro Ala Glu Val Pro Lys Gly Asp Arg Thr Ala Gly Ser 
65 70 75 



80 



477 



CCG CCA CGC ACC ATC TCC CCT CCC CCG TGC CAA\ GGA CCC ATC GAG ATC 



525 
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Pro Rro 



Arg ThfNlle Ser Pro Pro Pro Cys Gin Gly Pro lie Glu lie 



\ 



85 

\ 



90 



95 



AAG GAG ACT TTC Aa\ TAC ATC AAC ACG GTT GTG TCC TGC CTT GTG TTC 573 



Lys 



Glu Thr Phe Lys Tyr lie Asn Thr Val Val Ser Cys Leu Val Phe 
100 Y 105 110 



GTG QTG GGG ATC ATC GGG AAC TCC ACA CTT CTG AGA ATT ATC TAC AAG 



621 



^al 



: ft 

I = i 



Le>u Gly lie lie Gly As\ Ser Thr Leu Leu Arg lie lie Tyr Lys 
115 \l20 125 



IaAC AAG \gC ATG CGA AAC GGT c\c AAT ATC TTG ATC GCC AGC TTG GCT 



669 



Isn Lys Cyife Met Arg Asn Gly Pro\,Asn lie Leu lie Ala Ser Leu Ala 

C3 130 \ 135 \ 140 

E3- 

£IETG GGA GAC V TG CTG CAC ATC GTC AT Y GAC ATC CCT ATC AAT GTC TAC 

f Leu Gly Asp Leu Leu His lie Val Ile\Asp lie Pro lie Asn Val Tyr 
145 I 150 \ 155 160 



717 



AAG CTG CTG /GCA GAG GAC TGG CCA TTT GGA GCT GAG ATG TGT AAG CTG 



765 



Lys Leu Leu/ Ala Glu Asp Trp Pro Phe GlyYAla Glu Met Cys Lys Leu 

165 170 \ 175 



GTG CCT TT 



Val Pro Ph 



Z ATA CAG AAA GCC TCC GTG GGA ATC ACT GTG CTG AGT CTA 

s lie Gin Lys Ala Ser Val Gly lie VThr Val Leu Ser Leu 
180 185 \ 190 



813 



TGT GCT CTGNAGT ATT GAC AGA TAT CGA GCT GTT GC\ TCT TGG AGT AGA 



861 
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Cy$ Ala Leu Siar He Asp Arg Tyr Arg Ala Val Ala Ser Trp Ser Arg 
195 \ 200 205 



Al/r AAA GGA ATT G(SG GTT CCA AAA TGG ACA GCA GTA GAA ATT GTT TTG 



909 



IJ.e Lys Gly He Gly Val Pro Lys Trp Thr Ala Val Glu He Val Leu 
210 \ 215 220 

.TT TGG GTG GTC TCT GTg\gTT CTG GCT GTC CCT GAA GCC ATA GGT TTT 



3 

ile.Trp Val Val Ser Val 
225 230 



Va\L Leu Ala Val Pro Glu Ala lie Gly Phe 

235 240 



957 



GAT ATA ATT ACG ATG GAC TAC aVa GGA AGT TAT CTG CGA ATC TGC TTG 1005 

H \ 

Asp lie lie Thr Met Asp Tyr LysX Gly Ser Tyr Leu Arg lie Cys Leu 

. 245 \ 250 255, 

ft 
«=? 

ICTT CAT CCC GTT CAG AAG ACA GCT t\c ATG CAG TTT TAC AAG ACA GCA 1053 

Leu His Pro Val Gin Lys Thr Ala Phe\ Met Gin Phe Tyr Lys Thr Ala 
260 265 \ 270 

AAA GAT TGG TGG CTG TTC AGT TTC TAT tW TGC TTG CCA TTG GCC ATC 1101 

Lys Asp Trp Trp Leu Phe Ser Phe Tyr PhA Cys Leu Pro Leu Ala lie 
275 280 \ 285 

ACT GCA TTT TTT TAT ACA CTA ATG ACC TGT (Wi ATG TTG AGA AAG AAA 1149 



Thr Ala Phe Phe Tyr Thr Leu Met Thr Cys Glu\,Met Leu Arg Lys Lys 
290 295 500 



AGT GGC ATG CAG ATT GCT-TTA AAT GAT CAC CTA AAG CAG AGA CGG GAA 1197 
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305 



GTG 



Val 



CTT 



,fLeu 



W 

MfcAT 

in 

s Asn 

C3 
£0 ' 



•TAT 



Tyr 



GC1 



Ale 
385 



Gly Met Gln\le Ala Leu Asn Asp His Leu Lys Gin Arg Arg Glu 

310 315 320 

GCC AAA ACC GTC Wt TGC CTG GTC CTT GTC TTT GCC CTC TGC TGG 1245 

Ala Lys Thr Val Phe\Cys Leu Val Leu Val Phe Ala Leu Cys Trp 
325 \ 330 335 

CCC CTT CAC CTC AGC AG^ ATT CTG AAG CTC ACT CTT TAT AAT CAG 1293 

Pro Leu His Leu Ser Arg iae Leu Lys Leu Thr Leu Tyr Asn Gin 
340 \ 345 350 

GAT CCC AAT AGA TGT GAA CTtVtTG AGC TTT CTG TTG GTA TTG GAC 1341 

Asp Pro Asn Arg Cys Glu Leu L&u Ser Phe Leu Leu Val Leu Asp 
355 360 \ 365 

ATT GGT ATC AAC ATG GCT TCA CTG \aT TCC TGC ATT AAC CCA ATT 1389 

lie Gly lie Asn Met Ala Ser Leu Asn Ser Cys lie Asn Pro lie 
370 375 \ 380 

CTG TAT TTG GTG AGC AAA AGA TTC AAA KAC TGC TTT AAG TCA TGC 1437 

Leu Tyr Leu Val Ser Lys Arg Phe Lys Asn Cys Phe Lys Ser Cys 

390 395\ 400 



TTaI TGC TGC TGG TGC CAG TCA TTT GAA GAA AAA CAG TCC TTG GAG GAA 1485 

Leu| Cys Cys Trp Cys Gin Ser Phe Glu Glu Lys GlnXser Leu Glu Glu 

405 - 410 \ 415 



AAG I CAG TCG TGC TTA AAG TTC AAA GCT AAT GAT CAC GGA TAT GAC AAC 1533 
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J Lys 



TTC 



Phe 



Gin Ser Cys Le\ Lys Phe Lys Ala Asn Asp His Gly Tyr Asp Asn 
420 \ 425 430 

CGT TCC AGT AAT AAa\j?AC AGC TCA TCT TGAAAGAAGA ACTATTCACT 

Arg Ser Ser Asn Lys TyV Ser Ser Ser 
435 \440 



1583 



GTAfTTTCATT TTCTTTATAT TGGACCGAAG TCATTAAAAC AAAATGAAAC ATTTGCCAAA 1643 
*aC«AAACAAA AAACTATGTA TTTGCACAGO, ACACTATTAA AATATTAAGT GTAATTATTT 1703 



Jta^cactcac AGCTACATAT gacattttat GAGCTGTTTA cggcatggaa AGAAAATCAG 

H 

^G^GAATTAA GAAAGCCTCG TCGTGAAAGC AC^TAATTTT TTACAGTTAG CACTTCAACA 

■"■==! 

fiAbcTCTTAA CAACTTCCAG GATATTCACA CAACACTTAG GCTT AAAAAT GAGCTCACTC 



OaQAATTTCTA TTCTTTCTAA AAAGAGATTT ATTTTT^AAAT CAATGGGACT CTGATATAAA 



1763 
1823 
1883 
1943 



GpAAGAATAA GTCACTGTAA AACAGAACTT TTAAATGftAG CTTAAATTAC TCAATTTAAA 2003 

[■TTTAAAAT CCTTTAAAAC AACTTTTCAA TT AAT AT TAT CACACTATT.A TCAGATTGTA 2063 

[ttagatgca aatgagagag cagtttagtt gttgcatttt\tcggacactg GAAACATTTA 2i23 

VTGATCAGG AGGGAGTAAC AGAAAGAGCA AGGCTGTTTT lJGAAAATCAT TACACTTTCA 2183 

;tagaagccc aaacctcagc attctgcaat atgtaaccaa caVgtcacaa AGAAGCAGCA 2243 

[■GTAACAGAC TGGCACATGT GCCAGCTGAA TTTAAAATAT AATACTTTTA AAAAGAAAAT 2303 

PATTACATCC TTTACATTCA GTTAAGATCA aacctcacaa agagaVatag AATGTTTGAA 2363 



AGGCTATCCC AAAAGACTTT TTTGAATCTG TCATTCACAT ACCCTGTGAA GACAATACTA 2423 
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1 



I 

TC^ACAATTT TTTCAGGaVt ATTAAAATCT TCTTTTTTCA CTATCGTAGC TTAAACTCTG 2483 



TT 



TGGTTTTG TCATCTGTAA\ATACTTACCT ACATACACTG CATGTAGATG ATTAAATGAG 2543 



GGCAGGCCCT GTGCTCATAG CTTTACGATG GAGAGATGCC AGTGACCTCA TAATAAAGAC 2603 
TG TGAACTGC CTGGTGCAGT GTOCACATGA CAAAGGGGCA GGTAGCACCC TCTCTCACCC 2663 



AT 3CTGTGGT TAAAATGGTT TCTAGCATAT GTATAATGCT ATAGTTAAAA TACTATTTTT 
P \ 

.RCAAAATCATA CAGATTAGTA CATTTAACAG CTACCTGTAA AGCTTATTAC TAATTTTTGT 



TAT 



I - 



CpTGlrACAAGAG AAAACGGAAG AGAGAGGAAA TGAGGTGGGG T TGG AGG AAA CCCATGGGGA 

□ 



2723 
2783 
2843 



'?ATTTTTG TAAATAGCCA ATAGAAAAGT TTGCTTGACA TGGTGCTTTT CTTTCATCTA 

%AGGCAAAAC tgctttttga gaccgtaagaX ACCTCTTAGC TTTGTGCGTT CCTGCCTAAT 2903 

'-■4 

Ptt+tatatct tctaagcaaa gtgccttagg Atagcttggg atgagatgtg TGTGAAAGTA 



2963 



3023 



CflGATTCCCA TTCTTAGCCT AACGTTCGTC ATTGQCTCGT CACATCAATG CAAAAGGTCC 3083 

TEATTTTGTT CCAGCAAAAC AC AGT GC AAT GTTCTCAGAG TGACTTTCGA AATAAATTGG 3143 

GpCCAAGAGC TTTAACTCGG TCTTAAAATA TGCCCAAaW TTTACTTTGT TTTTCTTTTA 3203 

ATAGGCTGGG CCACATGTTG GAAATAAGCT AGTAATGTTGVTTTTCTGTCA ATATTGAATG 3263 

GATGGTACA GTAAACCAAA ACCCAACAAT GTGGCCAGAA AfSAAAGAGCA AT.AATAATTA 3323 

XCACACAC CATATGGATT CTATTTATAA ATCACCCACA AAcVrGTTCT TTAATTTCAT 3383 

AATCACT TTTTCAGAGG CCTGTTATCA TAGAAGTCAT TTTAGACTCT CAATTTTAAA 3443 



TTAATTTTGA ATCACTAATA TTTTCACAGT TTATTAATAT ATTTAAtVtC TATTTAAATT 3503 - 



\ 
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TTAGJATTATT TTTATTACCA >RGTACTGAAT TTTTACATCC TGATACCCTT TCCTTCTCCA 3563 

TGTCEAGTATC ATGTTCTCTA ATTATCTTGC CAAATTTTGA AACTACACAC AAAAAGCATA 3623 

CTTGCATTAT TTATAATAAA ATTGCaVtCA GTGGCTTTTT AAAAAAAATG TTTGATTCAA 3683 

AACjTTTAACA TACTGATAAG TAAGAAACAA TTATAATTTC TTTACATACT CAAAACCAAG 3743 

ATAGAAAAAG GTGCTATCGT TCAACTTCAA YaCATGTTTC CTAGTATTAA GGACTTTAAT 3803 



atJagcaacag acaaaattat tgttaacatg ga\gttacag ctcaaaagat ttataaaaga 

hi 



Mt 1 : 1 , 



: AA 



UJ^TTAACCTA TTTTCTCCCT TATTATCCAC TGCTAATGTG GATGTATGTT CAAACACCTT 

AGTATTGA TAGCTTACAT ATGGCCAAAG GAATAC^GTT TATAGCAAAA CATGGGTATG 

GTAGCTAA CTTTATAAAA GTGTAATATA ACAATGTAAA AAATTATATA TCTGGGAGGA 

Cfr^TTTTGGTT GCCTAAAGTG GCTATAGTTA CTGATTTTTT VtTATGTAAG CAAAACCAAT 



AATTTAA GTTTTTTTAA CAACTACCTT ATTTTTCACT GTACAGACAC TAATTCATTA 
aa-Aactaatt GATTGTTTAA AAGAAATATA AATGTGACAA GTGGACATTA TTTATGTTAA 



3863 
3923 
3983 
4043 
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4223 



ATATIACAATT ATCAAGCAAG TATGAAGTTA TTCAATTAAA ATGCCACATT TCTGGTCTCT 4283 



GGGAAAAAAA AAAAAAAA 
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